Difference computation using change identification techniques for structured web documents

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Structured Parallel Computation in Structured Documents

Document archives contain large amounts of data to which sophisticated queries are applied. The size of archives and the complexity of evaluating queries makes the use of parallelism attractive. The use of semantically-based markup such as SGML makes it possible to represent documents and document archives as data types. We present a theory of trees and tree homomorphisms, modelling structured ...

متن کامل

Structured Information Retrieval for Web Documents

To overcome the limitations of conventional Web search engines in retrieving Web documents relevant to users' queries, one has to exploit semantic structures embedded in Web documents. We propose a Web Information Retrieval (WebIR) model for Web documents containing semantic elements which are text segments enclosed by special tags. These special tags, known as semantic tags, can either be inde...

متن کامل

Georeferencing Semi-Structured Place-Based Web Resources Using Machine Learning

In recent years, the shared content on the web has had significant growth. A great part of these information are publicly available in the form of semi-strunctured data. Moreover, a significant amount of these information are related to place. Such types of information refer to a location on the earth, however, they do not contain any explicit coordinates. In this research, we tried to georefer...

متن کامل

VTML for Fine-Grained Change Tracking in Editing Structured Documents

The task of creating documents collaboratively is complex and it requires sophisticated tools. Structured documents provide a semiorganised writing environment where collaboration may assume more controlled forms than with other document types. CoEd is a writing environment that provides integrated structure support, content overview and version management for complex and hierarchical documents...

متن کامل

Exploring Structured Documents and Query Formulation Techniques for Patent Retrieval

This paper presents the experiments and results of DCU in CLEF-IP 2009. Our work applied standard information retrieval (IR) techniques to patent search. Different experiments tested various methods for the patent retrieval, including query formulation, structured index, weighted fields, document filtering, and blind relevance feedback. Some methods did not show expected good retrieval effectiv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IOP Conference Series: Materials Science and Engineering

سال: 2021

ISSN: 1757-899X

DOI: 10.1088/1757-899x/1022/1/012054